Semantic Segmentation, Context Windows, Document Boundaries, Retrieval Units

Feeds to Scour
SubscribedAll
Scoured 15532 posts in 897.1 ms
A Systematic Analysis of Chunking Strategies for Reliable Question Answering
arxiv.org·1d
📄Semantic Chunking
Preview
Report Post
How Search Engines and AI Systems Extract Answers From Structured Content
dev.to·2h·
Discuss: DEV
📊Search Ranking
Preview
Report Post
GutenOCR: A Grounded Vision-Language Front-End for Documents
arxiv.org·4h
🤖Advanced OCR
Preview
Report Post
Exploring Text Compression
denvaar.dev·1d
📝Text Compression
Preview
Report Post
Inside Mixedbread: How We Built Multimodal Late-Interaction at Billion Scale
mixedbread.com·2d·
Discuss: Hacker News
🗂️Vector Search
Preview
Report Post
Personal Knowledge Management
reseek.net·2h
🧠Personal Knowledge Base
Preview
Report Post
Redacting Faces, People, Vehicles, and Plates with Amped Replay Assisted Redaction
blog.ampedsoftware.com·18h
🧪Archive Fuzzing
Preview
Report Post
You Probably Don’t Need a Vector Database for Your RAG — Yet
towardsdatascience.com·1d
🗂️Vector Databases
Preview
Report Post
Building an Intelligent Web Document Scanner with OCR and Chrome's Built-in AI
dev.to·7h·
Discuss: DEV
📄Document Streaming
Preview
Report Post
Everything Open 2026 – Day 2
blog.darkmere.gen.nz·10h
🔍Archive Search
Preview
Report Post
Introducing multimodal retrieval for Amazon Bedrock Knowledge Bases
aws.amazon.com·1d
🌀Brotli Dictionary
Preview
Report Post
Databases are magic ... until ...
silvestreperret.com·1d·
Discuss: Hacker News
🗄️Database Internals
Preview
Report Post
From Retrieval to Relevance: 5 Reranker Types Defining Modern Search Systems
pub.towardsai.net
·1d
🎯Retrieval Systems
Preview
Report Post
The Silent AI Breach: How Data Escapes in Fragments
hackernoon.com·13h
🔓Hacking
Preview
Report Post
Patterns All the Way Down: A Generalization for Graph-Like Things
medium.com·17h·
Discuss: Hacker News
🤝Unification Algorithms
Preview
Report Post
Show HN: We built an OCR API to stop babysitting extraction pipelines
news.ycombinator.com·1d·
Discuss: Hacker News
👁️Constructive OCR
Preview
Report Post
featurestorebook/mlfs-book: O'Reilly book - Building Machine Learning Systems with a feature store: batch, real-time, and LLMs
github.com·8h·
Discuss: Hacker News
🧠Machine Learning
Preview
Report Post
Explainer: Tree-sitter vs. LSP
lambdaland.org·1d
🌳Context free grammars
Preview
Report Post
Everything Moe
ianbarber.blog·1d·
Discuss: Hacker News
🧠Learned Compression
Preview
Report Post
Two workflow challenges from 2012 that were solved as a byproduct of something else
statmodeling.stat.columbia.edu·1d
Proof Automation
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help